Reconstruction of Ancestral Gene Order after Segmental Duplication and Gene Loss

نویسندگان

  • Jun Huan
  • Jan Prins
  • Wei Wang
  • Todd J. Vision
چکیده

As gene order evolves through a variety of chromosomal rearrangements, conserved segments provide important insight into evolutionary relationships and functional roles of genes. However, gene loss within otherwise conserved segments, as typically occurs following large-scale genome duplication, has received limited algorithmic study. This has been a major impediment to comparative genomics in certain taxa, such as plants and fish. We propose a heuristic algorithm for the inference of ancestral gene order in a set of related genomes that have undergone large-scale duplication and gene loss. First, approximately conserved (i.e. homologous) segments are identified using pairwise local genome alignment. Second, homologous segments are iteratively clustered under the control of two parameters, (1) the minimal required number of shared genes between two clusters and (2) the maximal allowed number of rearrangement breakpoints along the lineage leading to each descendant segment. Finally, we compute an estimated ancestral gene order for each cluster that is optimal in some sense. We evaluate the performance of this algorithm on simulated data that models a genome evolving by large-scale duplication, duplicate gene loss, transposition, translocation, and inversion. The results suggest that long segments of ancestral gene order may be reconstructed following moderate levels of rearrangement with only minor loss of accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gene Family: Structure, Organization and Evolution

  Gene families are considered as groups of homologous genes which they share very similar sequences and they may have identical functions. Members of gene families may be found in tandem repeats or interspersed through the genome. These sequences are copies of the ancestral genes which have underwent changes. The multiple copies of each gene in a family were constructed based on gene duplicati...

متن کامل

Reconstruction of Ancestral Gene Order Following Large Scale Genome Duplication and Gene Loss

Gene order evolves through gross chromosomal rearrangements, small scale inversions and transpositions, gene duplication, and gene loss. Much research has been done on the calculation of edit distance and on sorting algorithms under a variety of rearrangement models in which the genome may be represented as conserved segments with permuted order and orientation. However, gene loss within otherw...

متن کامل

Sporadic Gene Loss After Duplication Is Associated with Functional Divergence of Sirtuin Deacetylases Among Candida Yeast Species

Gene duplication promotes the diversification of protein functions in several ways. Ancestral functions can be partitioned between the paralogs, or a new function can arise in one paralog. These processes are generally viewed as unidirectional. However, paralogous proteins often retain related functions and can substitute for one another. Moreover, in the event of gene loss, the remaining paral...

متن کامل

Supplemental 
 Section 
 S 1 
 – 
 Genome 


Supplemental
Section
S1
–
Genome
Sequencing
and
Assembly........................................... 2
 Supplemental
Section
S2
–
Indel
Assessment
With
the
Neutral
Indel
Model ................11
 Supplemental
Section
S3
–
Great
Ape
Divergence
Estimate
via
WGS
Read
Mapping..11
 Supplemental
Section
S4
–
Short
Read
Sequencing................................................................13
 Suppl...

متن کامل

Inferring Ancestral Gene Orders for a Family of Tandemly Arrayed Genes

Tandemly arrayed genes (TAG) constitute a large fraction of most genomes and play important biological roles. They evolve through unequal recombination, which places duplicated genes next to the original ones (tandem duplications). Many algorithms have been proposed to infer a tandem duplication history for a TAG cluster. However, the presence of different transcriptional orientations in many c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003